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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/793,408A 



DATE: 04/09/98 
TIME: 14:45:15 



INPUT SET: S24836.raw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



SEQUENCE LISTING 



(1) 



General Information: 



(i) APPLICANT: Choo, Yen 

Klug, Aaron 

Sanchez Garcia, Isidro 

(ii) TITLE OF INVENTION: Improvements in or Relating to 
Binding Proteins for Recognition of DNA 

(iii) NUMBER OF SEQUENCES: 125 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Pillsbury Madison & Sutro, L.L.P. 

(B) STREET: 1100 New York Avenue, N.W. 

(C) CITY: Washington 

(D) STATE: D.C. 

(E) COUNTRY: USA 

(F) ZIP: 20005-3918 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Word Perfect 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/793,408 

(B) FILING DATE: 02-JUN-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/GB95/01949 

(B) FILING DATE: 17-AUQ-1995 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 9514698.1 

(B) FILING DATE: 18-JUL-1995 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 9422534.9 

(B) FILING DATE: 08-NOV-1994 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 9416880.4 
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47 (B) PILING DATE: 20-AUG-1994 
48 

49 (2) INFORMATION FOR SEQ ID NO: 1: 
50 

51 (i) SEQUENCE CHARACTERISTICS: 

52 (A) LENGTH: 60 base pairs 

53 (B) TYPE: nucleic acid 

54 (C) STRANDEDNESS : single 

55 (D) TOPOLOGY : linear 
56 

57 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
58 

59 CTCCTOCAOT TGGACCTQTG CCATGGCCGG CTGGGCCGCA TAGAATGGAA 50 
60 

61 CAACTAAAGC 60 
62 

63 (2) INFORMATION FOR SEQ ID NO: 2: 
64 

65 (i) SEQUENCE CHARACTERISTICS: 

66 (A) LENGTH: 92 amino acids 

67 (B) TYPE: amino acid 

68 (C) STRANDEDNESS: 

69 (D) TOPOLOGY: unknown 
70 

71 (ii) MOLECULE TYPE: protein 

72 

73 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

74 

75 Met Ala Glu Glu Arg Pro Tyr Ala Cys Pro 

76 5 10 
77 

78 Val Glu Ser Cys Asp Arg Arg Phe Ser Arg 

79 15 20 
80 

81 Ser Asp Glu Leu Thr Arg His lie Arg He 

82 25 30 
83 

84 His Thr Gly Gin Lys Pro Phe Gin Cys Arg 

85 35 40 
86 

87 He Cys Met Arg Asn Phe Ser Xaa Xaa Xaa 

88 45 50 
89 

90 Xaa Leu Xaa Xaa His Xaa Arg Thr His Thr 

91 55 60 
92 

93 Gly Glu Lys Pro Phe Ala Cys Asp He Cys 

94 65 70 
95 

96 Gly Arg Lys Phe Ala Arg Ser Asp Glu Arg 

97 75 80 
98 

99 Lys Arg His Thr Lys He His Leu Arg Gin 
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85 90 

Lys Asp 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TATGACTTQG ATGGGAGACC GCCTOG 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
AATTCCAQGC GGTCTCCCAT CCAAGTCA 
(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TATATAGCGT GGGCGTATAT A 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GCGTATATAC GCCCACQCTA TATA 
(2) INFORMATION FOR SEQ ID NO : 7: 
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153 

154 (1) SEQUENCE CHARACTERISTICS: 

155 (A) LENGTH: 21 base pairs 

156 (B) TYPE: nucleic acid 

157 (C) STRANDEDNESS : single 

158 (D) TOPOLOGY: linear 
159 

160 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
161 

162 TATATAGCGN NNGCGTATAT A 21 
163 

164 (2) INFORMATION FOR SEQ ID NO: 8: 
165 

166 (i) SEQUENCE CHARACTERISTICS: 

167 (A) LENGTH: 24 base pairs 

168 (B) TYPE: nucleic acid 

169 (C) STRANDEDNESS: single 

170 (D) TOPOLOGY: linear 
171 

172 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
173 

174 GCGTATATAC GCNNNCGCTA TATA 24 
175 

176 (2) INFORMATION FOR SEQ ID NO: 9: 
177 

178 (i) SEQUENCE CHARACTERISTICS: 

179 (A) LENGTH: 33 base pairs 

180 (B) TYPE: nucleic acid 

181 (C) STRANDEDNESS: single 

182 (D) TOPOLOGY: linear 
183 

184 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
185 

186 TTCCATGGAG ACGCAGAAGC CCTTCAGCGQ CCA 33 
187 

188 (2) INFORMATION FOR SEQ ID NO: 10: 
189 

190 (i) SEQUENCE CHARACTERISTICS: 

191 (A) LENGTH: 33 base pairs 

192 (B) TYPE: nucleic acid 

193 (C) STRANDEDNESS: single 

194 (D) TOPOLOGY: linear 
195 

196 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
197 

198 TTCCATGGAG ACGCAGGTGA GTTCCTCACG CCA 33 
199 

200 (2) INFORMATION FOR SEQ ID NO: 11: 
201 

202 (i) SEQUENCE CHARACTERISTICS: 

203 (A) LENGTH: 33 base pairs 

204 (B) TYPE: nucleic acid 

205 (C) STRANDEDNESS: single 
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206 (D) TOPOLOGY: linear 

207 

208 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

209 

210 CCCCTTTCTC TTCCAOAAGC CCTTCAGCGQ CCA 33 
211 

212 (2) INFORMATION FOR SEQ ID NO: 12: 
213 

214 (i) SEQUENCE CHARACTERISTICS: 

215 (A) LENGTH : 33 amino acids 

216 (B) TYPE : amino acid 

217 (C) STRANDEDNESS : 

218 (D) TOPOLOGY: unknown 
219 

220 (ii) MOLECULE TYPE : peptide 

221 

222 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

223 

224 Met Ala Glu Qlu Lys Pro Phe Gin Cys Arg 

225 5 10 
226 

227 lie Cys Met Arg Asn Phe Ser Asp Arg Ser 

228 15 20 
229 

2 30 Ser Leu Thr Arg His Thr Arg His Thr Gly 

231 25 30 

232 

2 33 Glu Lys Pro 

234 

2 35 (2) INFORMATION FOR SEQ ID NO: 13: 
236 

2 37 (i) SEQUENCE CHARACTERISTICS: 

238 (A) LENGTH: 33 amino acids 

2 39 (B) TYPE: amino acid 

240 (C) STRANDEDNESS: 

241 (D) TOPOLOGY: unknown 
242 

24 3 (ii) MOLECULE TYPE: peptide 

244 

245 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

246 

247 Met Ala Glu Glu Lys Pro Phe Gin Cys Arg 

248 5 10 
249 

250 lie Cys Met Arg Asn Phe Ser Glu Arg Gly 

251 15 20 
252 

253 Thr Leu Ala Arg His Glu Lys His Thr Gly 

254 25 30 
255 

256 Glu Lys Pro 

257 

258 (2) INFORMATION FOR SEQ ID NO: 14: 
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